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Several alternatives to manage large XML document collections exist, ranging from file 
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system designed from scratch for storing and processing XML data. Contrary to the common 
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New computing paradigms for network-aware applications need specification languages able 
to deal with the features of mobile code-based systems. A coordination language provides a 
formal framework in which the interaction of active entities can be expressed. A coordination 
language deals with the creation and destruction of code or complex agents, their 
communication activites, as well as their distribution and mobility in space. We show how 
the coordination language PoliS offers a flexible ... 
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We have developed a Web based multimedia educational system, which has been designed 
to provide support for both synchronous and asynchronous distance learning activities. On 
the one hand, the system provides students with asynchronous educational material under 
the different form of HTML slides, digital audio/video lectures, software packages and 
simulating instruments for laboratory activities. On the other hand, the system allows users 
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In this paper, we present a mobile streaming media CDN (Content Delivery Network) 
architecture in which content segmentation, request routing, pre-fetch scheduling, and 
session handoff are controlled by SMIL (Synchronized Multimedia Integrated Language) 
modification. In this architecture, mobile clients simply follow modified SMIL files 
downloaded from a streaming portal server; these modifications enable multimedia content 
to be delivered to the mobile clients from the best surrogates in the CD ... 
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This paper presents a novel editor supporting interactive refinement in the development of 
structured documents. The user performs a sequence of editing operations on the document 
view, and the editor automatically derives an efficient and reliable document source and a 
transformation that produces the document view. The editor is unique in its 
programmability, in the sense that the transformation can be obtained through editing 
operations. The main tricks behind are the utilization of the view- ... 
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From a methodological point of view vertical migration involves the following four steps: 
identification of migration objects, prediction of expected system improvements, 
implementation, and verification of the results. In this paper a firmware monitor is 
presented as a support tool for the first and fourth step. The application environment for this 
monitor is a PDP-11/60 with writable control store running the UNIX operating system. 
Based upon a UNIX system model the requirement ... 
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tools, designing the XML content and the transformation sheets is a tedious and error prone 
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ABSTRACT 

This paper presents a model for describing the synchro- 
nization between several media delivered over a network in 
a Web-based environment. Synchronization concerns the 
download and the activation of coordinated media files ac- 
cording to the structure of the whole hypermedia document, 
the playback status of the media objects, and the user in- 
teraction. The model defines five synchronization primitives 
some of which can be automatically deducted from the doc- 
ument structure and the media-related events. The model 
is oriented to describe applications based on a main video 
or audio narration to which static or dynamic documents 
are attached. The model is suited for applications like dis- 
tance education, professional training, Web advertising and 
selling, and news-on-demand. 

General Terms 

Hypermedia, World Wide Web, media delivery, synchroniza- 
tion, interactive presentation 

L INTRODUCTION 

In this paper we present a synchronization model for navi- 
gating hypermedia documents in a distributed environment 
like the World Wide Web. The model is targeted to a class 
of applications where one or more continuous media files 
(video or audio streams) are presented to the user and, as 
streams play, other documents (text and images) are sent 
to the browser and displayed in synchrony with them. The 
user can interact with the presentation by pausing and re- 
suming it, by moving forward or backward, and by following 
hyperlinks that lead him/her to a different place or time in 
the same hypermedia document or to a different document. 
We use the term of "video-centered" hypermedia to denote 
such documents. 

Even if the model does not cover efficiently all hypermedia 
document types its reference context is wide, and includes 
applications like self and distance education, professional 

Permission to make digital or hard copies of all or pari of this 
work for personal or classroom use is granted without fee provided 
that copies are not made or distributed for profit or commercial 
advantage and that copies bear this notice and the full citation on 
the first page. To copy otherwise, to republish, to post on servers 
or to redistribute U> lists, requires prior specific permission and or 
fee. 

SACOO March 19-21 Como, Italy 

(c) 2000 ACM 1-581 1 3-239-5/00/003 >...>$3.00 



training, Web advertising and selling, and news-on-demand. 
As an example, in a Web application for estate selling a 
video could show the property and, as details are displayed, 
text pages describe the main features. From time to time 
maps and facade prospects are shown, giving the user the 
possibility to explore specific issues in a flexible way. An- 
other example is the remote delivery of a lesson composed 
of the teacher's lecture video or audio recording that illus- 
trates the issues discussed in the classroom, together with 
the set of slides which are shown at proper times during the 
lecture playback. 

Well refer mainly to distance education with some exam- 
pies to illustrate our work. As a general framework, we 
assume that a lesson is delivered to the user as a hyperme- 
dia document composed of several modules hierarchically 
organized. Each module is composed of a continuous me- 
dia stream (e.g., an audio stream) which, as it plays, makes 
texts, images and possibly other continuous media docu- 
ments to be displayed to the user screen. The user may 
adapt the pace of the lesson to his/her needs by moving in 
the module or across the modules in a controlled way: e.g., 
by skipping some parts, playing again other parts, temporar- 
ily suspending the lesson to browse supplemental material, 
and so on. The system re-synchronizes (i.e., delivers to the 
browser) the different media in order to preserve the consis- 
tency of the whole lesson. 

The paper is organized as follows: in Section 2 a model for 
video-centered hypermedia documents is illustrated. Sec- 
tion 3 describes the primitives for synchronized delivery of 
different media and basic user interaction. Section 4 briefly 
discusses additional synchronization issues due to user in- 
teraction. Section 5 comments the related literature, and 
Section 6 presents some final remarks. 

2. A MODEL OF VIDEO-CENTERED HY- 
PERMEDIA DOCUMENTS 

Hypermedia documents are modeled along two directions, 
one describing the hierarchical structure of the document 
components, the other describing the synchronization prim- 
itives and the delivery dynamics. 

A hypermedia document is generally composed of a set of 
modules, which the user can address through some table 
of contents. For example, a multimedia course is divided 
into lessons, which can be indexed through a syllabus or 
a time schedule; a news-on-demand application delivers in- 
dependent articles selected from an index; an estate selling 
application lets the user choose the relevant property from 
a catalogue, and so on. 
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Figure 1: Structure of video-centered hypermedia document 

Coining back to the distance education scenario, we assume 
that from the presentation dynamics point of view each les- 
son is completely autonomous and self-contained. We do 
not elaborate further on this level of access since it is not 
relevant for delivery and synchronization issues, and assume 
a module, whatever be its definition, as the topmost level of 
aggregation of media components which is relevant for our 
discussion. 

Borrowing the terminology from document structuring, we 
call chapter such a module. A chapter is divided into sec- 
tions: a section is a multimedia document which is normally 
played without requiring user intervention, while moving 
from a section to the next one can be automatic or not de- 
pending on the document purpose and application context. 
Sections could be addressable from an index as chapters are, 
but still we shall not enter into such details. 
As a dynamic document, a section is composed of continu- 
ous media clips and static pages of text and images. Section 
playback is ruled by the continuous media, with static pages 
delivered and displayed at specific narration points. A sec- 
tion is thus divided into smaller components that correspond 
to the time intervals between two narration points in which 
one or more static documents must be delivered to the user. 
If we analyze the structure from a dynamic point of view, 
we can better refer to a terminology borrowed from movies 
context. The continuous media presentation which consti- 
tutes the main chapter content is a story, which is made 
of a sequence of clips 1 , each of which corresponds to a sec- 
tion of the chapter. Clips are divided into scenes, each of 
which is associated to a different static document, e.g., a 
text page or an image, which is displayed during the scene 
playback. The term narration introduced above denotes 
the playback of the continuous components of a hypermedia 
document. Figure 1 pictorially shows this structure. More 
complex structures with nested multimedia components will 
be introduced later. 

2.1 Logical vs. physical structure 

The structure illustrated in figure 1 is a logical structure, 
which is in principle unrelated to the physical organization of 
the media objects. A video file could correspond to a whole 
clip, to a scene within it, or to a sequence of clips that can 
be played continuously or as independent segments. Indeed, 
the correspondence between logical and physical organiza- 
tion should be hidden in an implementation layer, without 
influencing the model organization. In practice this cannot 
be done without paying a price in terms of performance and 
manageability even in a local environment (e.g., a CD-ROM 
based multimedia presentation). 

! The reason for such a name will be clear later. 



We distinguish between two kinds of media objects: contin- 
uous media files and document files like texts and images. 
Continuous media files are video and audio recordings, usu- 
ally called videoclips or andioclips. Texts and images are 
contained in static documents usually called pages in the 
WWW parlance 2 . 

A correspondence between the logical and the physical struc- 
ture exists: 

• a clip, i.e., the narrative content of a section, is a video 
or audio file which is assumed to be played continu- 
ously from beginning to end, unless user interaction 
modifies this behaviour; 

• a static document is a file which is supported and dis- 
played as a whole by the browser, both directly or 
through a plug-in or helper application. We implic- 
itly refer to HTML documents using the generic term 
of page, without entering in further detail about other 
formats (XML, PDF, etc.). 

From the above assumption it comes that one logical clip 
(i.e., a section) is implemented by one media file. Scenes 
are defined by time intervals in the clip: they must be con- 
tiguous and are played one after the other. Also pages are 
bound to specific contents associated to a scene. We can 
thus say that a scene with its associated page is the small- 
est amount of information that is delivered as a whole, and 
a clip and its associated set of pages are the smallest self- 
consistent and autonomous information with respect to the 
application domain. 

A page must be associated with a scene in order to be dis- 
played. According to the synchronization primitives that we 
shall discuss later, playing a scene causes the corresponding 
page to be displayed. Conversely, accessing a page via a 
hyperlink causes the corresponding scene to be played. Co- 
herently with the class of applications we want to model, 
a master-slave relationship is established between the dy- 
namic and the static components of a hypermedia document: 
the dynamic components control the hypermedia document 
playback, and the static components are synchronized ac- 
cordingly. We shall come back to this matter later. 
The model can also be applied to the synchronization of 
more than two media, for example a video, a background 
music, and a set of pages. A master-slave relationship is 
establish not only between the video and the text pages, 
but also between the video and the music track. In general, 
one of the media is defined as the master medium for the 
whole presentation, and the other are played according to 
the events denned and raised by the master. 

2.2 Channels 

Media objects require proper channels to be displayed or 
played. Channels are basically windows or frames on the 
user screen 3 . Several compatible media objects may share 
in different times the same channel; the model must describe 
how channels are reserved, used ad released during hyper- 
media documents delivery. 

2 We do not consider text documents with embedded dy- 
namic components like, e.g., video sequences, lb be precise, 
we do not consider them as "dynamic" with reference to the 
control of the whole hypermedia delivery or playback. 
3 Audio channels do not use windows, but must be handled 
the same ray. 
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A channel is a (virtual) display or playback device like a 
window, a browser frame, an audio channel or an media 
player, that can be used by one media at a time. The model 
does not enter into details about channels properties, the 
only relevant properties being an identifier that uniquely 
identifies it on the user workstation, and a type that defines 
the media types that can use it. 

Channels are defined and associated to media objects at 
design phase, and defaults are established consistently with 
the WWW environment (e.g., the main browser window is 
the default visual channel for all the media). 
A channel is busy if an active media is using it, otherwise 
it is free, free channels may still hold some content: for 
example at the end of a movie playback the movie window 
can still display the first or the last frame. 

3. SYNCHRONIZATION AND MEDIA DE- 
LIVERY RELATIONSHIPS 

If the media objects are the elements of a hypermedia doc- 
ument, the synchronization primitives are the rules that de- 
scribe their behaviour. A distributed hypermedia document 
is composed of several media objects that interact based on 
events raised by their execution. An event is the occurrence 
of a change in an object state which affects the presentation, 
e.g. the start of a movie clip, the download of a text page, 
the end of an audio track, or a user command that moves 
to another movie frame or activates a hyperlink. In order to 
keep the model as simple as possible, we define only three 
events: the activation of a media object, the deactivation 
due to normal playback termination, and the forced deac- 
tivation due to external causes like a user action or a syn- 
chronization action with other objects. We do not consider 
events like windows resizing, or changes in the visualization 
properties since they do not modify the synchronization re- 
lationships among the document components. 
In order to define the synchronization relationships we must 
first define the concept of active media object with reference 
to a WWW-based environment. A continuous media object 
is active when it is playing at the user site, and becomes 
inactive when playback terminates, both naturally and by 
explicit user action 4 . 

For non continuous media the concept of active object is 
more conventional than substantial. For example, a text 
page is active while it is displayed; indeed, it should be con- 
sidered active only while the associated scene is playing, 
even if it is still displayed after the scene ends. In order to 
simplify the model we associate to a static object an un- 
limited time length, and give the synchronization primitives 
the role of removing it from the used channel when no longer 
needed. 

The synchronization primitives are defined as relationships 
between media objects. We define five synchronization prim- 
itives: 

• A activates B, denoted by A => B 

• A plays with B, denoted by A B 

4 The actual implementation with current WWW servers 
and browsers assumes that a streaming technology is used to 
deliver such media, instead of downloading and local play- 
ing, thus associating the active phase both to the object 
download and playback. 



• A is replaced by B, denoted by A *± B 

• A has priority over B, denoted by A X B 

• A is terminated with B, denoted by A 4 B 

which are detailed in this section and in the next one. Some 
of these relationships need to be explicitly defined by design 
when structuring the hypermedia document, while other can 
be automatically inferred. Globally, they define the synchro- 
nized behaviour of the whole hypermedia application during 
sequential play and user navigation. 

3.1 Basic synchronization primitives 

Two basic relationships must be defined for the hypermedia 
design phase. 

Definition 1. Let A and B be two continuous media ob- 
jects. We define U A activates B", written A =» B, the 
relationship such that object B is activated (i.e., played) 
immediately after deactivation of object A due to normal 
termination. If A and B are built on the same media (e.g. 
both are videos) B may occupy the same channel as A since 
it is free after A termination. 

Definition 2. Let A and B be two generic media objects. 
We define "A plays with B", written A <* B, the relation- 
ship such that the activation of A or B, i.e., the beginning 
of playback (for continuous media) or display (for static me- 
dia), causes the other object to be activated too. Each ob- 
ject occupies a different channel. 

The "plays with" relationship is not symmetric: there is a 
master-slave dependence between one object, say A, which 
plays by virtue of a user action or by synchronization with 
another object, and the other object, say B, which plays by 
virtue of the relationship. If A is put into active state, it 
puts B into active state too, and viceversa. 
The activation of the whole hypermedia document is always 
initiated by the user that accesses the starting point through 
an index. According to the structural model the starting 
point is a chapter or a section, therefore it is a story or 
a clip. If it is a story, the first clip of the story must be 
activated. The "plays with" relationship can therefore be 
automatically inferred (as a form of inheritance) between a 
story and its first clip, with the story acting as the master. 
Similarly, the same relationship is inferred between a clip 
and its first scene. 

The "activates" relationship => can be inferred between the 
scenes of a dip, since a clip is played continuously (in ab- 
sence of user interaction). As a result, only the relationships 
between a chapter and a story and between the clips of a 
story have to be defined explicitly. 

Synchronization between continuous and static documents 
(i.e., scenes and pages) must be defined explicitly at design 
phase, since in principle pages are not ordered: their order- 
ing comes as a result of scene ordering and scene- page rela- 
tionships. We introduce them on a working example used 
consistently in the remainder of the paper. The example 
models a small lesson in a self-learning application. 
The lesson is composed of one chapter, therefore of one story, 
composed by two video clips cl and c2 which must be played 
one after the other, and by five text documents (pages) pi 
through p5 which must be displayed during the clips play- 
back. Clip cl is composed of two scenes $cl and $c2 % whereas 
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Figure 2: A synchronization example 



clip c2 contains three scenes, *c3, sc4 and scb. Each scene 
must be associated to a different text page. 
Once accessed by the user (through some index), the chapter 
makes the story to become active. Since we are still at 
the beginning of the hypermedia document delivery, this 
action is not a true synchronization action. A "plays with" 
<=► relationship exists between the story and its first clip 
cl, and between it and scene scl. Both relationships are 
automatically inferred by the story-clip-scene hierarchical 
structure. Scene scl activation must in synchrony display 
the first page of the accompanying text pi. Similarly, $c2 
p2, ... , scb <=> p5. Finally, since the two clips of the story 
must be played one after the other, a relationship cl =» c2 
must be explicitly defined. 

Figure 2 pictorially shows these relationships. The figure 
contains also other relationships (;=±) needed to further spec- 
ify the dynamic behaviour, which are automatically inferred 
by the lesson structure and will be introduced in next sec- 
tion. 

3.2 Media delivery and channel synchroniza- 
tion 

The relationships introduced in previous section describe 
also the media delivery requirements. Document compo- 
nents corresponding to physical media objects, i.e., sections 
(clips) and pages, must be sent to the user before being ac- 
tivated. From Figure 2 we can deduct that accessing the 
story requires the first clip to be activated. Since it is a 
file, it must be delivered (downloaded or streamed) before 
activation. When activated, its first scene is also activated, 
and this requires the first text page to be activated too. 
Therefore page pi must be downloaded to the user browser. 
Each media object knows which channel to use. If it is busy, 
the system can free the channel by deactivating the object 
that keeps it, or can create a new channel instance (e.g. a 
new window or audio track) for the media playback. The 
choice is of course not arbitrary, since it depends on the 
presentation meaning, therefore it must be described in the 
synchronization model. 

In our example the video presentation uses channel cnl (a 
browser frame) while the text pages are displayed in channel 
cu2 (another browser frame). When the first scene is played 
in channel cnl page pi is displayed in channel cn2. When 
scene scl terminates, playback continues with scene sc2 in 
the same channel. Since sc2 p2, page p2 must be acti- 
vated, but channel cn2 is still busy because page pi is active. 
We need therefore to introduce additional synchronization 
primitives to define channel deallocation. 

Definition 3. Let A and B be two objects of the same 
media type. We define "A is replaced by B n , written A ^ B, 



the relationship such that the activation of B forces A to 
go into an inactive state, i.e., to terminate its playback or 
display. B can occupy the channel released by .4. 

Figure 2 shows, the relationships pi ^ p2, p2 ^ p3, p3 ^ p4 
and p4 z=± p5 describe how text pages replace each other by 
sharing the same window. In order to keep the whole pre- 
sentation semantically consistent, replacement may happen 
only if the replaced document is associated to a scene no 
longer active, and can thus be moved to the inactive state 
without problems. For this reason the ^ relationship need 
not be defined at design phase; it can be automatically in- 
ferred from the other relationships. 

Referring to figure 2, given four objects ,4, B, C and D, 
whenever relationships A => B y A o C and B O D hold 
and C and D are compatible media types sharing the same 
channel, we can derive the relation C ^ D. In plain words, 
pages associated to sequential scenes are correctly assigned 
the output channel at right time. Other cases can be de- 
fined where additional synchronization relationship can be 
inferred from the ones defined at design phase, but we can- 
not examine them at deep extent in this paper. 

33 Following hyperlinks 

Text pages may embed references to other documents which 
are external to the current hypermedia document. If the 
user jumps, via a hyperlink, to another static document (say 
p6 from p3), it can be displayed in another window, or can 
use the same window used by current text document, re- 
placing it. To specify this behaviour we need to introduce a 
^ relationship between the origin and the destination of a 
hyperlink to a compatible object. This relationship can be 
inferred from the presence of a hyperlink*. 
By dynamically propagating the effect of this relationship, 
we can also infer p6 ;=* p4, which specifies the synchroniza- 
tion action to be performed at the end of scene sc3 play. 
The modeling of the lesson is complete. 
We resume the synchronization relationships defined, noting 
whether they are defined at design phase or inferred from 
other repationships. Figure 4 illustrates the whole set of 
relationships. The arcs labeled with the symbol U are de- 
scribed in Section 4. 

• story o cl for association between the story and clip, 
which is inferred from the document structure; 

• cl o scl for association between clip and its first scene 
that is inferred from the relation above; 

5 A different solution should be adopted if the hyperlink 
refers to the target document as a new window. 
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• scl pi, sc2 p2, sc3 p3 , sc4 p4 and sc5 O 
p5 for associations between scene and page that must 
be explicitly defined by the authors of the presentation; 

• scl => sc2, sc3 => sci sc4 => $cb for the scenes suc- 
ceeding that could be inferred from the clip- scene or- 
ganization; 

• cl => c2 for the clips succeeding that are defined by 
design; 

• sc2 => sc3 which is inferred (inherited) by the activate 
relationship between the two clips; 

• pi ^ p2, p2 ^ p3, p3 ^ p4, p4 ^ p5 for the page ter- 
mination that could be inferred by tracing the scene 
sequence, by observing that pages share the same win- 
dow. 

for a total of 16 relationships, 6 of which are defined by de- 
sign, while the remaining 10 can be automatically inferred 
from the document structure and the relationships implica- 
tions. 

3.4 Nested continuous media objects 

The user might need to switch temporarily to another mul- 
timedia document during the presentation. The contempo- 
rary presence of two (or more) continuous media presenta- 
tions which are not coordinated (like a video with an asso- 
ciated audio channel) cannot be correctly described by the 
synchronization primitives so far introduced. Paying atten- 
tion to two video or audio clips simultaneously is not pos- 
sible, and resource sharing cannot simply solved by brute 
force replacement of channel usage. We need to set up a hi- 
erarchy between two continuous media, to define how they 
take precedence over each other. 

Definition 4- Let A and B be two continuous media ob- 
jects. We define tt A has priority over B n , written A >- B, a 
relationship such that when object A becomes active while 
B is active too, B is moved to a suspended state, and play- 
back of A begins. B waits up to deactivation of object A, 
then goes back to active state resuming its playback. 
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Figure 3: Access to nested multimedia 

4. USER INTERACTION 

We introduce a few issues about synchronization require- 
ments due to user interaction, without entering into details 
that would require a much more thorough discussion, far 
beyond the size of this paper. 

When a multimedia presentation starts, its behaviour in ab- 
sence of user actions is defined by the synchronization prim- 
itives we have discussed, according to the document main 
structure. Any termination not caused by the natural end- 
ing of a continuous media can however bring the presenta- 
tion into an incorrect state. 

Let us suppose that the user stops a videoclip playback dur- 
ing a scene. Since the scene is interrupted, it does not ac- 
tivates the following scene. The last frame may still hold 
the clip window, but since the dip is no longer active it 
may be replaced by any new clip the user decides to start, 
e.g., by selecting another chapter or section from the doc- 
ument index. Since the page associated to the interrupted 
scene is still active, it also must be terminated in order to 
release the channel. Therefore, we need to define another 
synchronization primitive to correctly describe this case. 

Definition 5. Let A and B be two generic media objects. 
We define "A is terminated with B n , written A # B t the 
relationship such that the forced ending of object A causes 
the forced termination of B object. 



We note that the suspended state was not considered as a 
relevant state in our previous discussion. Indeed, this state 
does not cause any modification in the other media objects, 
that may however be influenced by the activation of the 
object with higher priority. 

As an example, let us suppose that a page, say p2, contains 
a hyperlink to a multimedia clip c3. While cl and c2 are 
the main thread of the presentation, c3 is a complementary 
section that the user may consult or not. The relationship 
c3 >- cl implies that if the user follows the hyperlink to 
c3, cl must be suspended in order to allow c3 to get the 
user attention. If c3 contains scenes to which text pages 
are associated, these will replace p2 as usual (if assigned 
to the same channel). Figure 3 illustrates this case. The 
relationship c3 X cl may also solve conflicts about the use 
of physical resources (like audio channels), but we do not 
enter into such details here. 



This relationship is automatically inferred for all the media 
objects between which a relationship of the type exists, 
as illustrated in figure 4, i.e., scl $ pi, sc2 ^ p2, *c3 U p3 , 
scA^p4 and sc5 ^ p5. 

The case when the user follows a reference to a different 
document has already been discussed in previous section. 
We can now point out that, with reference to figure 4, p6 
takes the place of p3 in the relationship p4 *± p3, therefore 
the forced termination of sc3 must terminate p6 instead of 
p3; the relationship sc2 U J>3 becomes sc2 ^ p6. This is 
a dynamically established relationship. In any subsequent 
access to the same document the second clip remains tied 
up to its first page and not to the external document. 
Jumps forward or backward in a continuous media may be 
managed in the same way: a jump is a stop at the actual 
playback point, and a start at the destination point. Enter- 
ing into details is however out of the size of this paper. 
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Figure 4: The complete synchronization scheme 



5. RELATED WORK 

Media synchronization in hypermedia documents has been 
largely investigated. The first serious attempt to introduce 
multimedia synchronization in hypertext document is the 
Amsterdam Hypermedia Model [1; 2], that distinguishes be- 
tween atomic and composite components. Synchronization 
inside composite components is made by synchronization 
arches that establish time relationships between two com- 
ponents or two anchors. 

In [4] Standard Reference Model for Intelliget Multimedia 
Presentation Systems is presented. It describes an imple- 
mentation of processes required for the generation of mul- 
timedia presentations, dividing them into well- defined lay- 
ers (control layer, content layer, design layer and realization 
layer). Each layer has a precise role, but the SRM lacks of a 
specific document model. In [3] we can find a first attempt 
to integrate the AHM with the SRM leading to a more pre- 
cise model without losing expressiveness. [3] describes in 
details, using AHM as a basis, the stages of generating a 
hypermedia presentation. 

Spatio-temporal relationships of a multimedia presentation 
are discussed in [5]. The paper defines a model based on 
temporal and spatial relationships between different multi- 
media objects that compose a presentation. Temporal and 
spatial start points are defined, and the objects are set us- 
ing topological and time relationships between them. The 
result is a formalism suitable for designing any multimedia 
composition. 

The work [6] proposes a multimedia document model con- 
taining temporal information that can be described using 
traditional document structures (e.g., SGML). The model 
contains the logical structure, the layout structure, and the 
temporal structure to formally describe multimedia objects. 
It pays more attention to user interactions; the authors 
propose an interactive synchronization based on use of to- 
kens for the control scheme to solve problems due to re- 
synchronization after any user interactions. 
PR£MO[7], Presentation Environment for Multimedia Ob- 
jects, primarily focuses on the presentation aspects of multi- 
media. PREMO aims at describing a standard programming 
environment in a very general way, through which media 
presentation can be constructed, processed and rendered as 
part of an overall application. The model can be divided 
into four components: each component is described in an 
object-oriented way. The model supplies an abstraction for 
the various multimedia virtual devices that may produce 



different multimedia data of a presentation. An object type 
and its subtypes set up time relationships. Two objects can 
be rendered sequentially, in a parallel way or in alternative 
according to presentation state. 

The work [8J presents a scheme for intra- and inter-stream 
synchronization of stored multimedia. It proposes a new 
protocol to ensure continuous and synchronous delivery of 
distributed stored multimedia objects across a network com- 
munity. Three models are described that assure synchro- 
nization even in environment with different delays, jitters, 
sever drop-outs, clock drifts and alternation of the average 
delay using a resynchronization protocol based on buffer 
level control. Unlike many other works, the models here 
defined does not require synchronized clocks on the network. 
In [9] a model for synchronization of a hypermedia applica- 
tion is described. It :s based on hypercharts, a notation that 
extends the statechart formalism in order to make it able to 
describe temporal constrains of a multimedia presentation. 
Two other works discuss issues very close to the one ap- 
proached by our model. In [10] a model for flexible pre- 
sentations is defined, called FLIPS (for FLexible Interactive 
Presentation Synchronization). Media objects have no pre- 
defined time length, but it can change according to user 
interactions. The synchronization scheme is based on two 
temporal relationships, called barriers and enablers. If event 
A is a barrier to event B then B is delayed until event A 
occurs. If event A is an en abler for event B when event A oc- 
curs event B also occurs as soon as it is barrier-free. FLIPS 
defines five object states: idle, ready, in-process, finished 
and complete. 

The main differences with our model concern the system en- 
vironment and the hypermedia dynamics modeling. FLIPS 
is not oriented to network based applications, and appears 
more oriented to multimedia presentations, rather than to 
hypermedia navigation. No structure is in fact provided, 
other than the ones coming from the objects mutual inter- 
relationships. Synchronization is defined between object's 
states and not between the objects themselves. Using barri- 
ers and enables, the start or end of an object cannot directly 
cause the start or end of another, but could only change the 
state of the object at hand. For example, its beginning 
(activation in our model) depends from the absence of bar- 
riers that can be set or reset by complex conditions. In our 
model the start or end of an object could be caused only by 
the user or by events associated to another object, indepen- 
dently from the state of other presentation's components. 
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SMIL[11], Synchronized Multimedia Integration Language, 
is a W3C recommendation defined as an XML application. 
It is a very simple markup language that defines tags for 
presenting multimedia objects in coordinated way. Synchro- 
nization is achieved through two tags: seq to render two or 
more objects one after the other and par to reproduce them 
at the same time. Using attributes it is possible to play only 
segments inside the time span of an object. The screen is 
divided into regions inside which it is possible to put multi- 
media objects that form a presentation. Differently from our 
model, however, SMIL does not consider user interactions. 
Moreover, it does not define a reference model for the data 
structure, but only defines tags for describing media object 
presentation. 

6. CONCLUSION 

In this paper we have presented a model for describing the 
synchronization between several media delivered over a net- 
work in a Web- based environment. Synchronization con- 
cerns the download and the activation of the several media 
files according to the structure of the whole hypermedia doc- 
ument, the playback status of the media objects, and the 
user interaction. We have introduced five synchronization 
primitives some of which must be defined at hypermedia 
design phase while others, describing the playback dynam- 
ics, can be deducted from the document structure and the 
media-related events. 

The model is oriented to describe applications based on a 
main video or audio narration to which static or dynamic 
documents are attached. The model is suited for applica- 
tions like distance education, Web advertising and selling, 
and news-on-demand. 

We have verified our modeTs potentialities using commercial 
streaming plug-ins for WWW browsers, and servers for de- 
livery of continuous media over a network. Early experience 
has shown that our model provides a much more sophis- 
ticated (even if simple) scheme than provided by available 
software (e.g., RealMedia servers and players). Future work 
will be oriented mainly to investigate implementation issues. 
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Markup languages provide efficient ways of storing information on the Web and make the 
retrieval process easier. Indeed, its simplicity and added intelligence to the document makes 
it useful for messaging between clients and server. 
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As users acquire or gain access to an increasingly diverse range of web access clients, web 
applications are adapting their user interfaces to support multiple modalities on multiple 
client types. User experiences can be enhanced by clients with differing capabilities 
combining to provide a distributed user interface to applications. Indeed, users will be 
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frustrated if their interaction with applications is limited to one client at a time.This paper 
discusses the requirements for coordinating we ... 
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This paper outlines the latest version of XISL (extensible Interaction Scenario Language). 
XISL is an XML-based markup language for web-based multimodal interaction systems. XISL 
enables to describe synchronization of multimodal inputs/outputs, dialog flow/transition, and 
some other descriptions required for multimodal interaction. XISL inherits these features 
from VoiceXML and SMIL. The original feature of XISL is that XISL has enough modality- 
extensibility. We present the basic XISL tags, outl ... 
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We have developed a Web based multimedia educational system, which has been designed 
to provide support for both synchronous and asynchronous distance learning activities. On 
the one hand, the system provides students with asynchronous educational material under 
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the different form of HTML slides, digital audio/video lectures, software packages and 
simulating instruments for laboratory activities. On the other hand, the system allows users 
(i.e., students and educators) to establish synchronous c ... 
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XML [15] has become a standard format for information that moves within the World Wide 
Web. Previous work in securing XML documents concentrated mainly on textual documents. 
Those proposals are ineffective in the context of multimedia, which mostly comprises of 
some sensible combination of images, text, audio, and video. As multimedia constitutes a 
significant component of the traffic within the Internet, it requires to be secured. We 
propose an access control model and an encryption mechanism t ... 
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To create an innovative interactive multimedia application, a multimedia designer needs to 
rapidly explore numerous behavioral design ideas early in the design process, as creating 
innovative behavior is the cornerstone of creating innovative multimedia. Current tools and 
techniques do not support a designers need for early behavior exploration, limiting her 
ability to rapidly explore and effectively communicate behavioral design ideas. To address 
this need, we have developed a sketch-based, in ... 
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Recent Standards specifications offer important but underemployed techniques to maximize 
access-to and distribution-of information for real-time 3D visualization over the web. This 
paper describes and evaluates such techniques to transform structured data such as 
Chemical Markup Language (CML) to different forms and contexts for Web3D delivery using 
Extensible Stylesheet Transformations (XSLT), Extensible 3D (X3D), and VRML97. Standards 
design approaches offer a number of advantages: data durabi ... 
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Multimedia hypertext has grown from the basic addition of dynamic media only at "leaf 1 
nodes of the hypertext, to higher structured attempts to compose and integrate the different 
media. One of the core problems in this evolution has been, and still is, the construction and 
perception of context, making explicit which part of a presentation is relevant when media 
elements are integrated. The search for contextualized integration of video material with 
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other sources of information has emerged fro ... 
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Web technologies change very rapidly. Even if the latest developments are not yet visible in 
our day-to-day use, founding technologies of tomorrow's Web can already be identified. The 
XML revolution opens the door to new perspectives, specifically for presenting information 
and for interacting with Web applications and services. This article analyzes the most recent 
work related to Web multimedia, support for different devices, new interaction modes, and 
some other trends that will significantly ... 
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Business models designed specifically for the mobile arena are reshaping the industry. But 
each model has unique advantages and disadvantages— potential adopters should take care 
in choosing among them. 
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This paper outlines a methodology for digital representation and preservation of Pulaar 
language data. 
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